Sequential codes, lossless compression of individual sequences, and Kolmogorov complexity
نویسندگان
چکیده
A general class of sequential codes for lossless compression of individual sequences on a finite alphabet is defined, including many types of codes that one would want to implement. The principal requirement for membership in the class is that the encoding and decoding operations be performable on a computer. The OPTA function for the class of codes is then considered, which is the function that assigns to each individual sequence the infimum of the rates at which the sequence can be compressed over this class of sequential codes. Two results about the OPTA function are obtained: 1) it is shown that any sequential code in the class compresses some individual sequence at a rate strictly greater than the rate for that sequence given by the OPTA function; and 2) it is shown that the OPTA function takes a value strictly greater than that of the Kolmogorov complexity rate function for some individual sequences. Zndex TermsLossless compression, individual sequences, sequential codes, Kolmogorov complexity, Lempel-Ziv algorithm.
منابع مشابه
Two-Dimensional Kolmogorov Complexity and Validation of the Coding Theorem Method by Compressibility
The question of natural measures of complexity for objects other than strings and sequences, in particular suited for 2-dimensional objects, is an open important problem in complexity science. Here we provide a measure based upon the concept of Algorithmic Probability that elegantly connects to Kolmogorov complexity that provides a natural approach to n-dimensional algorithmic complexity by usi...
متن کاملLossless Compression and Complexity of Chaotic Sequences
We investigate the complexity of short symbolic sequences of chaotic dynamical systems by using lossless compression algorithms. In particular, we study Non-Sequential Recursive Pair Substitution (NSRPS), a lossless compression algorithm first proposed by W. Ebeling et al. [Math. Biosc. 52, 1980] and Jiménez-Montaño et al. [arXiv:cond-mat/0204134, 2002]) which was subsequently shown to be optim...
متن کاملReconciling Data Compression and Kolmogorov Complexity
While data compression and Kolmogorov complexity are both about effective coding of words, the two settings differ in the following respect. A compression algorithm or compressor, for short, has to map a word to a unique code for this word in one shot, whereas with the standard notions of Kolmogorov complexity a word has many different codes and the minimum code for a given word cannot be found...
متن کاملCompression Complexity
The Kolmogorov complexity of x, denoted C(x), is the length of the shortest program that generates x. For such a simple definition, Kolmogorov complexity has a rich and deep theory, as well as applications to a wide variety of topics including learning theory, complexity lower bounds and SAT algorithms. Kolmogorov complexity typically focuses on decompression, going from the compressed program ...
متن کاملA Guaranteed Compression Scheme for Repetitive DNA Sequences
We present a text compression scheme dedicated to DNA sequences. The exponential growing of the number of sequences creates a real need of analyzing tools. A specific need emerges for methods that perform sequences classification upon various criteria, one of which is the sequence repetitiveness. A good lossless compression scheme is able to distinguish between “random” and “significative” repe...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- IEEE Trans. Information Theory
دوره 42 شماره
صفحات -
تاریخ انتشار 1996